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We use quadratic maximum- likelihood (QML) estimators to constrain models with Gaussian but 
statistically anisotropic Cosmic Microwave Background (CMB) fluctuations, using CMB maps with 
realistic sky-coverage and instrumental noise. This approach is optimal when the anisotropy is small, 
or when checking for consistency with isotropy We demonstrate the power of the QML approach 
by applying it to the WMAP data to constrain several models which modulate the observed CMB 
fluctuations to produce a statistically anisotropic sky We first constrain an empirically motivated 
spatial modulation of the observed CMB fluctuations, reproducing marginal evidence for a dipolar 
modulation pattern with amplitude 7% at / < 60, but demonstrate that the effect decreases at 
higher multipoles and is < 1% at / ^ 500. We also look for evidence of a direction-dependent 
primordial power spectrum, finding a very statistically significant quadrupole signal nearly aligned 
with the ecliptic plane; however we argue this anisotropy is largely contaminated by observational 
systematics. Finally, we constrain the anisotropy due to a spatial modulation of adiabatic and 
isocurvature primordial perturbations, and discuss the close relationship between anisotropy and 
non-Gaussianity estimators. 



I. INTRODUCTION 

The temperature fluctuations of the Cosmic Microwave 
Background (CMB) are often assumed to be a realization 
of a statistically isotropic Gaussian random field. Decom- 
posing the temperature fluctuations 6(^1) into harmonic 
coefficients 

Qira = J dnYi:^{n)e{n), (i) 

the covariance of the CMB is then given by Cim.i'm' = 
(O/mQ^'m') ~ ^ii'^mm'Cf^ ^ and the statistical properties 
of the fluctuations are completely described by the power 
spectrum Cf®. The assumption of statistical isotropy is 
well motivated theoretically, both as an application of the 
Copernican principle and as a prediction of more detailed 
cosmological models such as inflation. It is a central tenet 
of modern precision cosmology and the standard ACDM 
model, and as such needs to be rigorously tested. There 
are already tantalizing hints in the WMAP data for vi- 
olation of statistical isotropy, for example alignments of 
low multipoles [l]-[3], the axis of evil [4], power asymme- 
tries O |F , and the cold spot [7 . Many of these oddities 
have been discovered in the absence of a proposed model, 
and so the degree of a posteriori bias which they are sub- 
ject to is difficult to assess. 

In this paper, we will discuss estimators which can be 
used to constrain the parameters of many CMB models 
which are Gaussian but perturbatively anisotropic, such 
that the covariance contains small off-diagonal elements. 
There are many good reasons to consider such models. 
Secondary effects that are linear in the CMB temper- 
ature result in a guaranteed signal of this form at the 
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O(10~^) level: gravitational lensing [8^, patchy reioniza- 
tion [9 and the doppler shifting due to the motion of our 
frame relative to the CMB, for example, may be thought 
of in this context and should become observable with 
the upcoming generation of CMB measurements. Many 
non-linear effects and non-standard models can be con- 
sidered as a fixed modulation of an initially Gaussian 
field, which is still Gaussian but anisotropic if the mod- 
ulation is considered fixed. More speculatively, recently 
proposed anisotropic models of inflation could lead to 
primordial fluctuations that are anisotropic [TOl [11] . A 
non-zero primordial bispectrum can also be considered in 
this framework, where there is a power anisotropy cor- 
related to the temperature, although the current data 
indicates that any non-Gaussianity consistent with the 
expectations from inflationary models is small. Models 
of modulated pre- heating [12], inflationary bubble colli- 
sions (e.g. [13 ), topological defects (e.g. [14 ), and vari- 
ous other interesting scenarios could also give observable 
effects that may show up in simple anisotropy estima- 
tors motivated by Gaussian anisotropic models, although 
the signature would typically not be Gaussian in these 
cases. In the case that the CMB is actually isotropic, 
the estimators which we will discuss simply place op- 
timal, minimum-variance constraints on the degree of 
anisotropy, and hence provide valuable tests of consis- 
tency and systematics. 

Assuming that the instrumental noise is also Gaus- 
sian, the rigorous approach to an analysis of these mod- 
els is clear: calculate the log-likelihood C of the observed 
CMB, given by 

- /:(0|h) = i0t(C®®)-i0 + ^ lndet(C®®), (2) 

where h are parameters characterizing the anisotropy, 
is the observed CMB and C®® = C®® + C^^ is 
its covariance, incorporating the theoretical (anisotropic) 
covariance as well as instrumental noise. For modern 
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datasets with hundreds of thousands of modes, the fuh 
covariance matrix C®® is unmanageably large. To work 
with it directly therefore requires either artificial limita- 
tion of the analysis to some subset of the data [15] |T6| , or 
exploitable sparseness [17J. In this work, we will take a 
quadratic estimator approach, expanding the likelihood 
to low order in the anisotropy. This approach is not a new 
one: it was originally used for the purpose of lens recon- 
struction by Ref. [18 , and many of the estimators that 
we will discuss here have also been derived for full-sky 
coverage as minimum- variance estimators with quadratic 
form [9l rr9ti22] . In this paper we will generalize these es- 
timators for application to real data, and show that in 
special cases where it is actually feasible to perform the 
more computationally intensive exact-likelihood analysis 
the quadratic approach produces effectively identical re- 
sults. 

Although we will frame our discussion here on Gaus- 
sian but statistically anisotropic models, we note that ev- 
ery statistically anisotropic Gaussian model is related to 
a statistically isotropic but non-Gaussian model: if there 
is a preferred direction, taking the direction as being a 
random variable (e.g. by a random rotation) makes the 
distribution statistically isotropic at the expense of com- 
plicating the statistics. In the statistically isotropic in- 
terpretation the anisotropy estimators we discuss would 
always have zero expectation, but the non-zero discon- 
nected four-point function gives the estimators a vari- 
ance above that expected for a Gaussian isotropic field, 
giving an equivalent means of detection. This interpre- 
tation is particularly useful for e.g. lens reconstruction 
and patchy reionization, where the particular realization 
of the anisotropy, as constrained by the likelihood esti- 
mator, is sometimes of less interest than its statistics, 
contained in the power spectrum of the estimates. 

We shall focus on the CMB temperature, since this is 
measured with much lower noise than the polarization, 
especially on smaller scales. However if there is signif- 
icant evidence for anisotropy, polarization information 
would ultimately be an excellent way to further constrain 
the origin of the signal: an anisotropic power spectrum at 
recombination for example, should give a consistent sig- 
nal in the polarization, but if the signal is local - e.g. due 
to an unknown foreground or instrumental systematic - 
the polarization signal could be quite different. The gen- 
eralization of these estimators to the polarized case is in 
principle straightforward. 



II. ANISOTROPY ESTIMATORS 

We begin by introducing the methodology of 
anisotropy estimation as a likelihood maximization, 
loosely following Hirata and Seljak who pioneered this 
approach for CMB lensing [18]. 

Differentiating the likelihood of Eq. Q with respect to 
a set of parameters h which characterize the anisotropy 
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The trace term results in a "mean field" over realizations 
of the observed CMB. To see this, consider the identity 
Tr(A) = {xtj4C~-'x), where A is any matrix and x is a 
vector of Gaussian random variables with covariance C . 
Making this substitution with C = and maximizing 
the likelihood by setting 5C/dh^ = gives the simple 
equation 



5C_ 
(5ht 
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where 
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The maximum-likelihood (M-L) point can be determined 
iteratively using Newton's method 
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where quantities subscripted with i are evaluated for the 
estimate of the i^^ iteration. Note that Eq. (|6| is 
a very general way to maximize a Gaussian likelihood 
when the covariance is perturbatively linear in a set of 
parameters. It leads to the power spectrum estimator of 
Ref. [23 , for example, if one takes the h parameters to 
encode the CMB power spectrum. We shall assume that 
the noise and normal cosmological parameters are known, 
so we are only maximizing the anisotropy parameters; 
if the assumed parameters (and hence isotropic power 
spectrum) are incorrect this could lead to small biases in 
the estimator. 

We are working under the assumption that any 
anisotropy which we will be studying is "weak", and so 
a single iteration of Eq. (|6|, starting from h = should 
give a sufficiently accurate estimate of h. For simplicity, 
the derivative term is replaced with its ensemble average 
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where the equality between the second and third lines 
is dictated by the normalization of the likelihood and T 
is the Fisher matrix, considering the likelihood to be a 
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function of the parameters h. Putting all of this together, 
we have an approximate, quadratic maximum-likelihood 
(QML) estimator of the form 



h = jp-i[h-(h)]. 



(8) 



The inverse Fisher matrix can be thought of as the esti- 
mator normalization, as well as its covariance in the limit 
of no anisotropy. The quadratic part of the estimator is 
given by 



Ho 



E 

Imd'm' 



^^?m, I'm' 

(5ht 



^Im^l'm' 



(9) 



where = {C®^)~^\q& is the observed sky after ap- 
plication of inverse- variance filtering with h = 0. In 
the limit of weak anisotropy this QML estimator satu- 
rates the Cramer-Rao inequality, and so is optimal in 
the minimum- variance sense. In practice, "weak" means 
non-detection, and so this form of quadratic estimator is 
excellent for testing statistical isotropy, but needs to be 
treated with care if a significant detection is made (as 
is soon expected to be the case, for example, with CMB 
lensing). In this work we have occasion to compare some 
of our results to exact likelihood calculations by other 
authors, and find good agreement. 

Thus, the QML formalism makes it straightforward to 
construct estimators for any form of Gaussian but sta- 
tistically anisotropic model which is accurately parame- 
terized as a linear function of a set of parameters h. To 
assess the significance of a possible detection, a useful 
statistic is the value of the estimate, given by 



X 
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For n constrained parameters, the measured value can be 
compared to the cumulative distribution function (CDF) 
of the Xn distribution to produce an intuitive "p-value" 
figure of merit, which for this paper we will define to be 
the probability of attaining a greater value of 
isotropic model. 

There are several advantages of the quadratic ap- 
proach: 

Speed: Provided that an inverse variance filtered sky-map 
has already been calculated, the evaluation of the h terms 
is at most O(^max)- practice, the QML approach often 
results in estimators which have a fast implementation 
in real space, making them 0(/max)- Simulations are 
often required to determine the mean-field and estimator 
normalization, but the number of these is generally less 
than are needed for an exploration of the likelihood. 

In many situations, QML estimators also enable the 
study of a much larger model space than could conceiv- 
ably be explored with exact likelihood evaluations. For 
example, the modulation anisotropy models which we 



will use in the following sections result in QML estima- 
tors which produce an entire modulation sky-map, rather 
than just estimating a small number of its coefficients as 
has been done in previous exact likelihood analyses. This 
can help to identify the source of contaminating signals. 

Systematic and jacknife tests: The quadratic estimator 
approach is amenable to certain systematic and jacknife 
tests for which there are no analogues in an exact like- 
lihood exploration. The effects of correlated noise, for 
example, can be avoided straightforwardly by forming 
the anisotropy estimates from cross-correlations between 
maps with different noise realizations. In this case, the 
likelihood approach which was used to derive our estima- 
tor is invalid. In particular, the estimator normalization 
and the reconstruction covariance are no longer equal. 
For full-sky coverage and homogeneous noise they may 
be derived using a minimum-variance approach, similar 
to that which is used in Ref. [24]. For the purpose of 
significance testing however, it is irrelevant whether the 
estimates are properly normalized, only that their covari- 
ance is accurately characterized. The value defined in 
Eq. (10), for example, is still a valid measure of signifi- 
cance, even though the inverse Fisher matrix no longer 
properly normalizes the estimates. 

The quadratic approach also enables the use of simple 
jacknife tests in harmonic space. The sum of Eq. (|9|, 
for example, can be broken up into contributions from 
different multipole ranges at no additional computational 
cost. Provided that the inverse variance filter does not 
significantly mix the modes of the observed CMB across 
the chosen multipole ranges, this can be used to construct 
anisotropy estimators in bands, and localize the origin of 
any detected anisotropy in harmonic space. 

In the quadratic approach the treatment of additional 
Gaussian but anisotropic systematics is also clear. Pro- 
vided that such effects can be incorporated into the sim- 
ulation and inverse-variance filtering steps they will be 
subtracted from the anisotropy estimates and their addi- 
tional contribution to the variance of the estimates will 
be included in J^~^. The effect of beam asymmetries and 
sidelobe pickup, for example, fall neatly into this cate- 
gory, though a method for correcting the contaminating 
signals could be better in practice. 

We now proceed to apply the QML formalism to sev- 
eral anisotropic modulation models, which we constrain 
using the WMAP data. 



III. IMPLEMENTATION 

We will test the assumption of statistical anisotropy in 
the WMAP 5-year maps, which are provided in HEALPix 
format at A^side = 512 [25 . Specifically, we will study the 
Q-, V- and W-band data^at 40, 60 and 90Ghz. We limit 
our analysis of the Q-band data to /max = 300, as its 
power spectrum becomes clearly contaminated by point 



4 



sources at higher multipoles, which we do not attempt 
to model. The mean field and estimator normalization 
are generally determined for each data map using Monte- 
Carlo simulations, although we will sometimes use ana- 
lytic approximations to the normalization when this is 
not possible. 

For the high-resolution WMAP data, the inverse- 
variance filtering operation is too costly to perform di- 
rectly. Instead, we use the conjugate gradients technique 
to iteratively solve [26j 



[{C 



Y_^N-^Y]C^®S = Y'^N-^S. (11) 



Here C®® is the intrinsic CMB covariance, diagonal in 
harmonic space in the limit of no anisotropy. Throughout 
this work, we will use a fixed fiat ACDM cosmology for 
C®®, with standard parameters {rt^^Qc^ h^ris^r^ As} = 
{0.05, 0.23, 0.7, 0.96, 0.08, 2.4 x 10"^}, which is consistent 
with the WMAP5 best-fit power spectrum [27 . The data 
itself is contained in the map vector and Yi is the 
pointing matrix, which provides the mapping between a 
harmonic space signal and the observed map: 







(12) 



where Bi is the transfer function for the beam, which 
is assumed to be symmetric. Finally, is the noise 

model, which we take in real space to be 



N- 



- N;^T^[T^NrlT]-'TN-^. (13) 



Here ATpT^ is the covariance matrix of the map noise, 
which is taken to be uncorrelated between pixels. To 
effect a sky-cut, ATpT^ is taken to be zero for masked 
pixels. T is an ntmpi. x^pix matrix of template maps to be 
projected out of the data. Unless otherwise noted, we use 
this only to project out the four templates corresponding 
to monopole and dipole modes. 

Within the conjugate gradients approach, the solution 
of Eq. (11) requires careful preconditioning to be fast 



enough for the large number of Monte-Carlo simulations 
which we perform. We implement the multigrid precon- 
ditioner given by Ref. [28 , which is the fastest to date. 
For a fractional error of less than O{10~^) in each mode 
of the calculated field, we find that our implementation 
has a typical cost of ten minutes on a 2GHz processor, 
evaluated to /max = 1000 for WMAP noise with a galactic 
cut. 



IV. RESULTS 

A. Modulation on the sky 

A popular form of modulation anisotropy which has 
been tested in the literature is given by 



where O ^ (n) is some intrinsic statistically isotropic CMB 
temperature, /(n) is a modulating field, and the / sub- 
script denotes restriction to some range of angular scales 
(e.g. / < /max)- The dipole part of /(n) has received par- 
ticular attention since the work of Ref. [29 , which found 
evidence for a large-scale hemispherical power asymme- 
try in the WMAP data. This pathfinding work was fol- 
lowed by more rigorous likelihood analyses [15, 30, 3T1, 
and recently Refs. [16] have extended the analysis to 
smaller angular scales, arguing for increased detection 
significance as more data is added. The exact likelihood 
analysis is currently limited by computational require- 
ments to multipoles / < 80, but with the QML approxi- 
mation to the maximum- likelihood estimator no such dif- 
ficulties arise, so we can extend our analysis to the limit 
of current observations. 

We note that the doppler effect giving the CMB kine- 
matic dipole also results in a dipole modulation of the 
assumed form, with /(n) = /3 • n. Assuming a small 
O(10~^) primordial dipole, /3 is expected to have am- 
plitude 0.0012 in the direction (/,6) = (260^,50°) of the 
observed total dipole. This signal should be present on 
all scales along with an aberration effect which is de- 
generate with a CMB lensing dipole. Although we will 
see that this effect is an order of magnitude too small 
to be observed with WMAP, it should be detected with 
high significance in the data of the recently launched 
Planck satellite [32]. Any robust signal detected at a 
higher level (amplitude ^ 0.001) would be an indica- 
tor of cosmological statistical anisotropy, or equivalently 
non-Gaussianity. 

The simplest quadratic estimator for the modulation 
is just - an estimator of the temperature variance in 
each pixel. In a statistically isotropic model a map of 
0^ should on average have only a monopole, and there 
should be no structure which is not consistent with noise 
or CMB fiuctuations. The QML estimator that we derive 
essentially generalizes this to optimally account for noise 
and cosmic variance. 



The temperature moments of Eq. ( 14 ) for the modula- 
tion anisotropy are given by 
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where /, V are restricted to the range of / under consid- 
eration. To first order in //^, the resulting covariance is 
therefore 



Climi,l2m2 — (®^imi0^2m2) 

= ^hl2^mim2Ch^^^ flm[Cl-^^Ci^] J d9YimYi\^Yi^rn2- 
Im 

(16) 



e/(n) = [l + /(n)]e)(n), 



(14) Using Eq. ([9| results in a QML estimator constructed 
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On the full sky the expected value depends only on the 
off-diagonal part of the underlying temperature realiza- 
tion covariance matrix, and is essentially an optimally 
weighted combination of the bipolar spherical harmonic 
coefficients introduced in Ref. [33 . Here we have written 
it in a form convenient for fast numerical evaluation in 
terms of real-space fields; in harmonic space it can be 
written as 
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Due to the triangle constraint |/i — /2I ^ I ^ |^i + ^2! 
from the 3-jm symbols, and the constraint that / + /i + ^2 
is even, we see that the dipole part of is due to the 
off-diagonal /i, /i + 1 part of the covariance, and similarly 
the quadrupole part depends on the and +2 

correlations. Note that the estimator is independent of 
the modulation pattern, amplitude and orientation, so 
the estimator only needs to be calculated once to recover 
all multipoles — there is no need to marginalize over all 
the possible orientations. 

The Fisher matrix is determined by the trispectrum of 
the inverse- variance filtered CMB. The CMB is assumed 
to be Gaussian and so this consists only of disconnected 
terms. A useful tool for forecasting is the portion of the 
Fisher matrix due to the isotropic terms, i.e. those in 
which any coupling between modes of the observed CMB 
are taken to be zero (i.e. we ignore the non-diagonal 
elements of C®® so ® ^ SwCj;''^). This is given by 
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Tff 



The corresponding estimator noise we denote as Ni 

practice, for small sky-cuts and roughly ho- 
mogeneous uncorrelated instrumental noise this is quite 
a good approximation to the true covariance. For full- 
sky coverage and homogeneous noise, it is exact. For a 
cosmic-variance limited reconstruction, N^-^ scales with 
the number of observed modes, as l^^^ at high-L For re- 
construction of the modulation dipole /i^, for example, 
we find - 6.24/-2^. 

In Fig. [1] we show the pseudo-C/ power spectrum of 
a typical fim reconstruction. For simplicity we use the 
isotropic normalization for this plot, with confidence in- 
tervals determined by Monte-Carlo simulation. The re- 
construction variance compares very well to the full-sky 
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FIG. 1: Pseudo-G of the fim reconstruction for the WMAP 
V-band foreground-reduced data, with KQ85 mask and 
/max = 64 (black solid). The full-sky, isotropic normalization 
was used rather than the actual inverse Fisher matrix. The 
[25,75]% (dark gray) and [5,95]% (light gray) confidence in- 
tervals measured from Monte- Carlo simulations are overlaid. 



analytical result, scaled by the unmasked sky-fraction. 
In Fig. [2] we show maps of the reconstructed /(n) for 
/max = 25, 64, 100, smoothed with a ten degree beam. 
The modulation reconstruction brings the hot and cold 
spots of the observed GMB into sharp relief. The GMB 
cold spot [34 , for example, is distinctly visible in all three 
maps. 

It can be seen from Fig. [l] that the amplitude of the 
reconstructed modulation is generally in good agreement 
with that expected from our AGDM simulations, except 
for the dipole which is slightly high. To make connection 
with previous work, we will now focus exclusively on the 
dipole modulation, restricting our reconstruction to the 
three modes of the dipole, for which we determine the full 
Fisher matrix by Monte- Gar lo. The three dipole modes 
may be simply related to the amplitude and galactic co- 
ordinates of a dot product modulation /(n) = A • n. In 
the upper panel of Fig. [3] we show estimates of this mod- 
ulation amplitude | A| for the WMAP data, as well as the 
expectation value (due to the estimator noise) for a full- 
sky cosmic variance limited measurement. The observed 
amplitudes generally lie above the full-sky expectation. 
This is partially due to the presence of instrumental noise 
and a sky-cut, particularly at high-/ where the WMAP 
data becomes noise dominated. In the lower panel of 
Fig. [3] we plot the analytical significance levels of the 
values calculated following Eq. (10), given three degrees 



of freedom. We can see that all of the observed p- values 
are somewhat low, typically at the five percent signifi- 
cance level. Interestingly, the probability of the observed 
fim in an isotropic model is smallest at /max = 40, 64, 
which are the two most previously studied values. 

We note that in their region of overlap, our measure- 
ments agree well with those of Ref. [16 . Our measured 
(A, b) values for /max < 100 are all within error of those 
which they quote, and consistent with a preferred direc- 
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FIG. 2: Reconstructed maps of /(n) for three values of /max, 
smoothed with a ten degree beam. We have used the isotropic 
normahzation for simphcity, which is invaUd close to the sky- 
cut, but works well otherwise. The '+' symbols mark the 
peak of the QML dipole. The 'x' symbol and ring in the 
/max = 64 plot marks the M-L dipole and error found by [16], 
which agrees well with our result. 
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FIG. 3: Summary of modulation dipole results for the 
foreground-reduced WMAP data. Solid lines correspond to 
KQ85 masking, and dashed lines use the KQ75 mask. Upper 
panel: Dipole amplitudes |A| as a function of the maximum 
multipole used in the reconstruction. The black dashed line 
gives the expected value for a cosmic- variance limited experi- 
ment, which is non-zero due to the estimator noise. The dot- 
ted lines give the reconstruction noise spectra measured from 
the simulations. They separate into two groups for KQ85 and 
KQ75 masking and are well described as f~^^ times the ideal 
result for /max < 300, but decrease more slowly at higher-/ as 
the instrumental noise becomes non- negligible. Lower panel: 
significances of the reconstructions in the isotropic model. 



tion of (/,6) = (225°, -22°) ± 24°. Their quoted signifi- 
cance values also agree well with the square-root of our 
values, although we note that for the purposes of test- 
ing ACDM, they should be treated as Xs significances, 
rather than Gaussian ones. Fig. [3] shows that the large 
modulation indicated by the low-/ data does not persist 
on smaller scales however, consistent with the tight con- 
straint on the anisotropy in the quasar distribution [35] . 
There is still some tension between these measurements 
of fim and an isotropic model. Although chance is a pos- 
sible explanation, it is intriguing that the observed am- 
plitude of dipole modulation is consistently high across a 
large range of multipole values (although it must be re- 
membered that the measurement of A is cumulative, and 
so the estimators at different /max can be strongly cor- 



related). To be rigorous, we consider here several other 
possible explanations for this tension: 

The cold spot: In Fig. [2] we have seen that the CMB "Cold 
Spot" [34 constitutes a prominent feature in the recon- 
structed modulation field. It is also close to the dipole 
of the reconstruction. It is possible that the large dipole 
amplitude is simply another detection of the Cold Spot. 
To test this, we perform the modulation reconstruction 
with a new mask which we call KQ85+CS5, created by 
augmenting the KQ85 mask with a circular cut of radius 
five degrees centered at (/, 6) = (208°, —56°). The results 
of this reanalysis are given in Fig. |4] The removal of the 
Cold Spot has a large effect for the /max = 25 reconstruc- 
tion, but does not significantly effect the reconstructed 
modulation at higher multipoles. 
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FIG. 4: Sensitivity of modulation results to various tests, sim- 
ilar to Fig. [3] All data are for WMAP V-band. Blue solid is 
the V-band foreground-reduced data, with KQ85 mask. Ma- 
genta is with the KQ85+CS5 mask. Green is for reconstruc- 
tions with /min = 20. Red lines are for raw maps, without 
template cleaning (solid/dashed correspond to KQ85/KQ75 
masking respectively.) 



Residual low-l anomalies: The modulation model may 
be motivated as a way to consistently treat the observed 
low-/ power deficit, asymmetry, and alignment issues [30] . 
To what extent do the low-/ anomalies imprint upon our 
measurements of fim for larger values of /max? We test 
this straightforwardly by excluding all of the filtered mul- 
tipoles below some /min from our analysis. We choose 
/min = 20, which ensures that any mixing of the anoma- 
lies at / < 10 due to the sky-cut will have negligible effects 
on the reconstruction. This is shown in Fig. |4] Again, 
this excision does not result in any significant reduction 
of the measured modulation. 

Foregrounds: The consistency of the modulation ampli- 
tudes with frequency band and sky-cut makes a fore- 
ground explanation for the observed tension seem un- 
likely. We also confirm that our results are not strongly 
dependent on the method of foreground subtraction, by 
marginalizing over the WMAP foreground templates [SG] 
in the inverse- variance filtering operation, similar to [15]. 
Further investigation raises two interesting points, how- 
ever. To understand the order-of-magnitude for poten- 
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FIG. 5: v^/(/ + l)/2(/2 + l)Cii^/2ii for a{k) = 1, with h = / 
(thick solid), I2 = I (dashed), /2 = / + 4 (dot-dashed) and 
/2 = / + 20 (dotted). 



tial foreground effects, we measure the modulation ampli- 
tudes for raw WMAP data, without foreground template 
subtraction. This is plotted in Fig. [4| We can see that 
complete neglect of foregrounds has a large effect on the 
estimated modulation. Interestingly, this increase is not 
strongly dependent on choice of sky-cut, indicating that 
consistency under changes in the mask size does not rep- 
resent strong evidence against foreground contamination. 
Additionally interesting is that the modulation dipole in- 
duced by foregrounds is closely aligned with the dipole 
determined for cleaned maps. In the quadratic estima- 
tor approach, an additive template which is uncorrelated 
with the primary CMB has an additive effect on the re- 
constructed modulation. The V-band template model, 
for example, gives a contribution to the dipole of an 
^max = 40 analysis in the direction (/,6) = (208°, -10°), 
due in roughly equal parts to the contribution from the 
Ha and Finkbeiner dust templates. All of these results 
are of course for the current foreground model, which is 
projected out of the data. It seems possible, however, 
that residual foreground contributions which are mor- 
phologically similar may source the tension between the 
modulation estimates and an isotropic model. 



B. Primordial Power Anisotropy 

Refs. [lOl [TTJ [17] consider anisotropic models of 
the early universe, where at late times the universe 
isotropizes so that the only evidence is an angular- 
dependent power spectrum on large scales (but isotropic 
transfer functions). The isotropic real adiabatic transfer 
functions Ai{k) are defined so that 

—Ai{k)xoimi*M (19) 
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where Xo(k) is the (statisticahy anisotropic) primordial 
perturbation with power spectrum defined by 



(Xo(k)xS(k')) = (27r)35(k-k')Px(k) 

27r2 

= (27rfS{k-k')^V^{k). (20) 
The covariance is then given by [10] 

C'limil2m2 ~ 



d^kP^(k) A,, {k)Ai, {k)Y,:^^ {k)Yi,m2 (k) . 



(21) 



The anisotropy is contained in the direction-dependent 
P^(k), and we can construct a QML estimator for the 
angular dependence of the power spectrum at a given k 
from Eq. ([9| using 



TT 



dnY,i 



Im 



J2 i^'Ai,{k)ei,^,Yi,rm 

.hmi 



.l2'm2 



(22) 



Refs. [m [171 [371 138] consider the particularly simple case 
where the /c-dependence is in a known function a{k) so 
that 



V^ik)=r^{k)[l+a{k)gik)], 



(23) 



and ^(k) = ^(— k) so gim has only even-/ modes. The 
covariance in this case is 



Climil2m2 — ^I\l2^rn\m2^li 



(24) 



where 



Ci,i, =47r J dlnkV^{k)a{k)Ai^{k)Ai,{k). (25) 
The QML estimator for g is therefore constructed from 



hi'. 



(26) 



The i^i~^2 term forces the reconstruction to zero for odd- 
/. Similar quadratic estimators have been derived be- 
fore on the full sky [21] . For a pixelization such as 
HEALPix which consists of (9(/max) isolatitude rings, 
the azimuthal sums may be evaluated using FFTs in 
^(^max log ^max)- To estimate gim to some maximum 
multipole A^, the Ci-^i^ matrix may be treated as band- 
diagonal with A^ bands, due to the triangle constraint on 



the integral of three spherical harmonics. This leads to a 
total cost of 0{Ail^^^ log /max)- An alternative approach 
is to discretize the integral of Eq. (25), in which case g 



may be evaluated as Uk spherical harmonic transforms, 
where Uk is the number of quadrature points for the inte- 
gral, leading to a cost of (^(^fc/max)- A similar technique 
was applied in Ref. [39 . 

The disconnected, isotropic component of the Fisher 
matrix in this case is given by 



^ 8n 1 



^tot^tot * 
h h 



(27) 



Again we take = [^solzT^- -"-^ ^^S- [s] we show the 
pseudo-C/ power spectra for the reconstructed gim^ for 
a{k) = 1 and a{k) = The a{k) = k~'^ results are 

generally in good agreement with Monte-Carlo expecta- 
tions. The a{k) = 1 reconstruction, on the other hand, 
has an anomalously large quadrupole. As with the mod- 
ulation dipole of the previous section, we investigate this 
more thoroughly by normalizing the estimates with the 
quadrupole Fisher matrix, estimated from simulations. 
In Fig. |8] we show the reconstructed quadrupole for the 
V-band data. Groeneboom and Eriksen [17] have per- 
formed a Bayesian analysis of the primordial modula- 
tion quadrupole, assuming the form ^(n) = ^*(k • n)^. 
They find a maximum-likelihood quadrupole described 
by ^* = 0.10 with k along (/, b) = (130°, 10°). Their anal- 
ysis was based on the published version of Ref. [11 which 
neglected the i^i~^2 prefactor in the covariance; our anal- 
ysis incorporates two additional degrees of freedom and 
includes the correct prefactor. The results are nonethe- 
less quite similar in magnitude and direction, though we 
find that the preferred direction lies closer to the ecliptic 
poles. 

The Ci of the QML quadrupole reconstructions are 
given in the top panel of Fig. [7| for the Q-, V- and 
W-band foreground-reduced datasets with a variety of 
cutoff /max i^ the /i,/2 sums of Eq. [26] In the lower 
panel are given the significances (calculated following 
Eq. 10), for a distribution with five degrees of freedom. 
At /max ^150 consistent results are seen in the Q-, V-, 
and W-band for all three datasets, independent of the 
choice of KQ75 or KQ85 masking, roughly tracing the 
amplitude of the estimator noise and consistent with no 
detection of anisotropy. Including data from higher mul- 
tipoles, a highly significant excess power develops. The 
increasing discrepancy between the V- and W-band data 
at /max > 200 seems indicative of systematic contamina- 
tion rather than a primordial origin. To gain a better 
understanding of the required systematic, we use the al- 
gorithm described in Appendix [A] to generate a CMB 
realization which contains the observed modulation as 
the sum of anisotropic and purely isotropic parts. This 
is presented in the lower two panels of Fig. [S] The map 
of the anisotropic part gives a measure of the magni- 
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FIG. 6: Pseudo-Ci of the gim reconstruction for the 
WMAP V-band foreground-reduced data, with KQ85 mask 
and /max = 400 (black sohd). The full-sky, isotropic nor- 
malization was used. The [25,75]% (dark gray) and [5,95]% 
(light gray) confidence intervals measured from Monte-Carlo 
simulations are overlaid. 



tude and spatial dependence of the systematic required 
to mimic the observed signal. The close alignment of 
the anomalous quadrupole modulation and the ecliptic 
seen in Fig. [S] suggests a systematic associated with the 
WMAP scan strategy. The two most suspicious possi- 
bilities are correlated noise and beam asymmetry effects, 
which have not been accounted for in this analysis, nor 
included in the Monte-Carlo simulations. 

Ref. [17] have already partially considered the effect of 
correlated noise on their results. Working with end-to- 
end simulations produced by the WMAP team for the 1- 
year data release, they find large effects for the W-band, 
although with qualitative structure different from that 
which is actually found in the data. In the quadratic- 
estimator approach, such possible contamination can be 
avoided by cross-correlating maps with different noise re- 
alizations, as discussed in Sec. [TTj Cross-correlating data 
for the individual differencing assemblies (D/As) of the 
V- and W-band data, we find that the anomalously large 
quadrupole power persists, further indicating that corre- 
lated noise is not a likely explanation for the observed 
excess. 



FIG. 7: Summary of results for ^'(n) quadrupole reconstruc- 
tions with a{k) = 1 for the foreground-reduced WMAP data. 
Solid lines correspond to KQ85 masking, and dashed lines use 
the KQ75 mask. Upper panel: Quadrupole power as a func- 
tion of the maximum multipole used in the reconstruction. 
The black dashed line gives N^^, the reconstruction noise for 
a cosmic- variance limited full-sky experiment. The dotted 
lines give the reconstruction noise measured from the simu- 
lations. They separate into two groups for KQ85 and KQ75 
masking and are well described as f~^^ • N^^ for /max < 300, 
but decrease more slowly at higher-/ as the instrumental noise 
becomes non-negligible. Lower panel: significances of the 
reconstructions in the isotropic model. 



The effect of beam asymmetries is more involved, how- 
ever a sample of the expected beam anisotropy effects can 
be found in Fig. 11 of Ref. [28 and Fig. 5 of gO]. The 
1/iK RMS contribution of beam asymmetries to the ob- 
served sky is aligned with the ecliptic and matches well 
with the amplitude in our Fig. [8} making beam asymme- 
tries a potential explanation for the observed anisotropy. 
To investigate this further, we analyze simulations of 
the WMAP data released by Ref. [40 , who publish 10 
maps of simulated beam-convolved skies for each of the 
WMAP D/As. The distribution of the resulting val- 
ues is given in Fig. |9] The effect of asymmetric beams 
is seen to be large. For the VI and W4 D/As, it com- 
pletely explains the 0(0.1%) significance of the observed 
quadrupole. We therefore conclude that current measure- 
ments of the power modulation quadrupole are strongly 
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A full analysis with beam asymmetries is beyond the 
scope of this paper, however we note that in the QML 
approach beam asymmetry effects can simply be incorpo- 
rated into the simulation pipeline. They will then appear 
as a contribution to the "mean field" term, and be sub- 
tracted from the reconstruction. In principle it is neces- 
sary to include the correlation due to beam asymmetries 
in the inverse variance filter, which is too computation- 
ally expensive to perform in general. If the instrumental 
noise can be approximated as white on the timescales 
which separate pixel visits, however, then the fast algo- 
rithm presented by Ref. [28 can be used for the forward 
convolution operation, which should only slow the appli- 
cation of the inverse variance filter by a constant factor 
of (9(20). Alternatively one could attempt to correct the 
maps for the beam asymmetries, for example by estimat- 
ing the anisotropic contribution by forward convolutions 
of the observed sky, then iteratively subtracting off the 
part due to beam asymmetries. 



-300 



300 



-4 



FIG. 8: Upper panel: QML reconstruction of the g{h) primor- 
dial power quadrupole, for the WMAP V-band foreground- 
reduced data to /max = 400. Also shown is the preferred 
quadrupole direction of (/, b) = (130°, 10°) found by Ref. [17 
('x^), using incorrect factor) and the ecliptic north pole 

at {l,b) = (96°, 30°) {^o'). Lower two panels: Isotropic and 
anisotropic components of a CMB simulation with g2m given 
by the QML reconstruction, smoothed with a WMAP V-band 
beam. This gives an intuitive understanding of the effects in- 
duced by this form of anisotropy. The RMS deviations of 
the isotropic and anisotropic components are 90 /j^K and l/j^K 
respectively. 



contaminated by beam asymmetry effects, which must be 
corrected for to obtain true constraints on any primor- 
dial modulation. That the signal strongly varies between 
the D/As indicates either that the simulation of Ref. [40] 
are not encapsulating all of the relevant beam effects, 
or that there is an additional unknown systematic. In 
any case the significant variations between D/As at the 
same frequency provide strong evidence that the signal is 
systematic rather than primordial or foregrounds; in all 
cases the preferred direction is close to the ecliptic. 



C. Local primordial modulation 

Finally, we consider the case where the primordial per- 
turbations Xo(x) are modulated in real space, so that the 
primordial perturbation field is 



X(x) = xo(x)[l + (/)(x)]. 



(28) 



The primordial Gaussian field xo is assumed to be sta- 
tistically homogeneous. Similar modulations have been 
considered before [19]. We consider the modulating field 
^(x) to be fixed, so the aim is to reconstruct the large- 
scale (/) field by looking at the induced modulation of 
smaller-scale perturbations. To leading order in <p the 
primordial covariance is given by 

(x(k)x(kO) = (2^)^(5(k + kOPx(fe) 

+ J d^xe-^(^+^'>^(^(x) [P^{k) + P^{k')] . (29) 

Note that the modulated field (with fixed (j)) is no longer 
statistically homogeneous. Expanding the exponentials 
using 



(30) 



Im 



and using Eq. (19) to relate the primordial perturbations 
to the observed temperature multipoles, the covariance 
is then 

+ j d^^(j){^)ai^ {x)(3i^ (a;)Yitmi (x)Yi2™2 (x) 

+ / dMi^)aUx)f3i,ix)Yil^^i±)Yi,^,i±), (31) 



11 




FIG. 9: Effect of beam asymmetry on the reconstruction 
of the ^(fi) quadrupole with a{k) = 1, for the WMAP 
foreground-reduced data with KQ85 masking. SoUd black is 
the distribution with five degrees of freedom. Gray his- 
tograms are the distribution of values for the isotropic sim- 
ulations used to determine the estimator normalization for 
each D/A. Black vertical dashed lines indicate for the ac- 
tual data. Red histograms are for the beam- convolved maps 
produced by Ref. [JO]. Black dotted histogram are for the in- 
put maps to the convolution, using symmetric beam transfer 
functions and separate noise realizations. 
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FIG. 10: The a and /3 functions for adiabatic transfer func- 
tions evaluated at r* = 14164Mpc, the comoving distance to 
the maximum visibility at the last scattering surface for our 
fiducial CMB parameters. 



where 



27r2 



ai{r) = Att J d\n kji{kr) 
/3i{r) = 47r y d\n kji{kr)Ai{k)P^{k). (32) 

Hence using Eq ([9| there is a QML estimator for the 
modulating field ^ with 



/- 



Jimi 



1 mi 



.12^2 



■ (33) 



This allows us to reconstruct a map of the modulation at 
any desired radius. The construction is very similar to 
that used when estimating a local bispectrum ^91 |4T] . 
where the main quantity of interest is the correlation 
of the reconstructed (j) with an estimator of x (see Ap- 
pendix [B] for further discussion). By looking at the mod- 
ulating field directly, we can also consider the case where 
(j) and X are uncorrelated, and hence there is no leading- 
order bispectrum. 

The isotropic component of the normalization is given 

by 



E 

ll,l2 



(2/i + l)(2/2 + l)^/ h hVioiiJi.^Pi.ai,)^ 




Stt 



h h 



(34) 



and typical a and /3 functions are shown for adiabatic 
modes in Fig. [To) 
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Integrating hf^{r) over a window function W{r) is 
equivalent to estimating a modulating field with that 
separable radial dependence; for large scale modulations 
with a constant W{r) this is close to the spatial modu- 
lation estimator considered in Sec. IV A| since any large- 
scale primordial modulation modulates essentially all of 
the CMB anisotropy sources in that direction (except 
possibly low redshift sources due to ISW that only effect 
low /). Mathematically this is because 



/ 



r'^drai^{r)l3i^{r) ~ Ci^ 



(35) 



for li close to I2 (with equality for li = I2). We there- 
fore do not repeat the analysis here, since the large-scale 
results of Sec. |IV A| with the full range of / can be in- 
terpreted as being approximately a constraint on large- 
scale primordial modulations. We showed that any total 
modulation must be < 1% to be consistent the data on 
all scales, so modulation of primordial adiabatic modes 
would not explain the marginal evidence for a (9(10%) 
modulation at / < 100. 

We can also consider the case where there is a modu- 
lated primordial isocurvature mode, combined with the 
usual unmodulated adiabatic mode. For example the 
modulation could be due to a large-scale perturbation 
in the background curvaton field [42 . The off-diagonal 
covariance in general depends on the power spectrum of 
the isocurvature modes as well as the correlation with 
the adiabatic modes, and the isotropic power spectrum 
is also modified due to the extra contributions (which 
gives a constraint on the amplitude of the isocurvature 
contributions). Estimators for the modulating field are of 
the same form as Eq. ( 32 ) but with the appropriate com- 



binations of isocurvature transfer functions and power 
spectra: either a and P both calculated with isocurva- 
ture transfer function, or the cross-estimator 



E 

.hmi 



E 

iimi 



(r)0^-j^772i-^imi 



E 

E A': 

.^2^2 



f^h^ (r) 0^27712 ^^2^2 



■(r)e 



Z2W-2 -^2^2 



(36) 



This corresponds to reconstructing the modulation of a 
primordial perturbation xo + ^-S*, where S is taken to be 
a CDM isocurvature perturbation correlated to xo- We 
assume the primordial adiabatic, isocurvature and cross 
power spectra are proportional, so up to normalization a 
and P differ only in the transfer function used. 

In Fig [11] we plot the pseudo-C^ power spectrum of 
(j)im reconstructed from the WMAP data for these mod- 
els. Large scale modulations should be approximately 
constant across the depth of the last-scattering surface, 
and so we simply evaluate the estimator at the radius of 
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FIG. 11: Pseudo-Ci of the (l)im reconstructions for the WMAP 
V-band foreground-reduced data, with KQ85 mask and 
/max = 400. The full-sky, isotropic normalization appropri- 
ate to each estimator was used rather than the actual inverse 
Fisher matrix. The [25,75]% (dark gray) and [5,95]% (light 
gray) confidence intervals measured from Monte- Carlo simu- 
lations are overlaid. The estimates are scaled by Ar[r*]^ 
to give a dimensionless constraint, with = lOOMpc, the 
approximate width of the last scattering surface. 



peak visibility where most of the small-scale CMB fluctu- 
ations are coming from. The estimators at a given r are 
significantly more noisy than the sky modulation estima- 
tors since the latter effectively average the result from 
many radial shells. We see no deviations from isotropy 
at the peak of the visibility. 

As explained above, doing a constant line-of-sight in- 
tegral is almost equivalent to a sky modulation for large- 
scale modulations. The same applies for isocurvature 
mode contributions (with greater accuracy since there is 
less contribution from ISW at low redshift), for example 
where the sky has adiabatic and isocurvature contribu- 
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tions: 



1.0 



e(n) = e^^(n) + a[l + /(n)]e^=°(n). 



(37) 



The isocurvature power spectrum cj^^~^^^ falls off on 
sub-horizon scales, so the data is consistent with an 
(9(10%) dipole modulation of the total temperature if 
the modulation is only in the isocurvature component. 
The isotropic power spectrum constrains the isocurva- 
ture component to be subdominant however, so this 
would mean that an (9(1) modulation of the isocurvature 
component is required, which would invalidate the as- 
sumptions in deriving the sky modulation estimator that 
I /I <C 1. However we can still apply a null- hypothesis 
test on a purely isotropic adiabatic model using what- 
ever parameterization we like, though interpretation of 
any detection of deviation may need to be treated with 
care. We do not attempt a fully consistent analysis of 
isotropic and anisotropic constraints on isocurvature con- 
tributions here. 

The isocurvature mode could also be correlated with 
the adiabatic mode, in which case we can constrain a 
model of the form 



e(n) = e^'^(n) + /(n)e^^°(n) 



Iso / 



(38) 



-yAd — Iso 



a depends on the ampli- 



where (ef^X*) = o^C] 

tude and degree of correlation, and Cf^^ is the spec- 
trum for totally correlated modes with the same initial 
amplitude {S = — Xo)- As usual we neglect terms of 
0{f'^). The monopole part of / allows for an isotropic 
isocurvature contribution. The estimators are almost the 
same as in the total temperature modulation case, with 



Im 



iimi 



.l2m2 



X 



(39) 

where is the relevant power spectrum, and we evalu- 
ate &im using the standard best-fit theory model in the 
assumption that a model with isocurvature modes must 
have about the same spectrum as the null hypothesis with 
no isocurvature modes. A more general model could also 
allow part of the adiabatic mode to be modulated. 

The constraint on isocurvature modulations is shown 
in Fig.[T2j though the data is still consistent with isotropy 
at the 1%-level. See Ref. [42] for more detailed discussion 
and possible physical mechanisms for generating modu- 
lated isocurvature modes. 

We have focussed on local modulations here, however 
the formalism can in principle be applied to more general 
primordial anisotropic models, for instance with 



X(x) = Xo(x) + 
d^ki d^k2 
(27r)3 (27r)3 



/ 



/(ki,k2)xo(ki)<^(k2)e^(''^+'^=)-^ (40) 



Such more general forms can arise for example in warm 
inflation |43J and other models giving non-local primor- 
dial bispectrum non-Gaussianity (see e.g. Ref. ^44^). A 
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FIG. 12: Same as Fig. [3] but using the power spectra for the 
indicated combinations of adiabatic and isocurvatu re t ransfer 
functions in the estimator (replacing Ci^ in Eq. (17)). All 
curves are for the WMAP V-band foreground-reduced data 
with KQ85 masking. 



detailed analysis is beyond the scope of this paper, but 
is similar to that required to constrain the bispectrum 
when (j) is equal to xo- 



V. CONCLUSIONS 

The assumption that the GMB is statistically isotropic 
is a common one in modern cosmology, and needs to be 
rigorously tested. In this paper, we argued that QML 
estimators are often ideal for this purpose. They pro- 
vide optimal, minimum-variance constraints and permit 
straightforward systematic and jacknife tests. We have 
demonstrated their use in application to the WMAP 
dataset for constraining several anisotropic models. Most 
notably, we reproduced the results of previous exact like- 
lihood analyses which showed marginal evidence for a 
dipolar modulation of the observed CMB for / < 100. 
However we showed that the amplitude of the modu- 
lation must fall significantly on smaller scales, consis- 
tent with observations that the small scale power in the 
universe (e.g. probed by quasars 05]) appears to be 
highly isotropic. This scale-dependence is inconsistent 
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with modulated adiabatic primordial modes, but could 
possibly be explained by modulated isocurvature modes, 
which have less power on small scales. However evidence 
for deviations from isotropy are only at the 1%-level. 

We also studied a model in which the primordial power 
spectrum has a direction dependence, detecting a signifi- 
cant anisotropy in the WMAP maps with close-to-ecliptic 
alignment. However we argued that this signal has an im- 
portant contribution from beam asymmetries, which are 
uncorrected in the maps, and that significant variations 
between differencing assemblies indicate that most of the 
signal is non-primordial. 

An important extension of this approach is the inclu- 
sion of polarization data. For anisotropics which are de- 
tected in temperature at marginal significance, polariza- 
tion can help to increase the signal to noise and to dis- 
tinguish possible primordial, local and systematic origins 
of any detection ^19j . 
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Appendix A: Anisotropic Simulations 

Here we present an algorithm for generating weakly 
anisotropic simulations^. We wish to sample from a 
Gaussian distribution with covariance 



C = [I ^ 5C[C']-^)0 



(Al) 



where C" is isotropic and 5C is the small anisotropic 
part. Then 



0= [I + 5C[&]-^f'^ 



(A2) 



where 0* has covariance will have the required co- 
variance. For small anisotropy this can be expanded to 



give 



~ 0^ + ]^3C[C']-^& + . . . . (A3) 



^ Thanks to Anthony Chalhnor for suggesting this presentation. 



Higher order terms can be included if required; here we 
just calculate the leading anisotropic term. In the case 
of an anisotropic primordial power spectrum this is 
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Cii.e 



Ci. 



(A4) 



Appendix B: Relation to bispectrum estimators 

Power anisotropy is closely related to primordial non- 
Gaussianity. Local non-Gaussianity gives power in 
squeezed triangles: large scale perturbations are corre- 
lated to small scale power over their extent. If the large 
scale perturbation were unknown, this would look just 
like a small-scale power anisotropy. Hence a method that 
reconstructs the power anisotropy should be sensitive to 
three-point non-Gaussianity, in that the correlation of 
the anisotropy estimator with the temperature is a probe 
of the bispectrum. Non-Gaussianity can also give rise 
to a dipole power asymmetry correlated with the (un- 
known) cosmological dipole. In the inflationary context 
a power anisotropy on a given scale typically indicates 
the impact of those modes when they were outside the 
horizon on the generation of smaller fluctuations, e.g. via 
the local change in the effective background seen by the 
smaller modes as they leave the horizon. Unfortunately 
the large variance of the anisotropy estimators means 
that for small amounts of non-Gaussianity any detection 
with standard shapes is expected to be via the correla- 
tion to the temperature, as in standard bispectrum anal- 
yses. However since there is marginal evidence for power 
anisotropics in the data it is worth considering whether 
these could be associated with some form of physically- 
motivated primordial non-Gaussianity. There may also 
be marginal observational evidence for local primordial 
bispectrum non-Gaussianity j46ti48] . so conceivably there 
could be a joint explanation. 

A statistically-isotropic and parity-invariant CMB bis- 
pectrum Bi^i^i^ is defined by 



{(^Iimi(^l2m2^l3ms) — -^^i^2^3 



^1 h h 
mi m2 ms 



(Bl) 



= ^hhh J dl^^^imi^^2m2^^3m37 (B2) 

where bi-^^i^i^ is the reduced bispectrum. When the 
CMB power spectrum is known, the optimal estimator 
for weakly non- Gaussian fields with bispectrum Bi^i^i^ 
is [391 Hal EQ] 



^^iTTli ^^2''^2 ^^3^-3 



^^^imi^2m2®^3m3 



, (B3) 
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where is a normalization. Following Ref. [39] this can 
be written as 

^ = Ws^ - 3(X,™)) (B4) 



Im 



where 



(B5) 



The field is of the form of the most general quadratic 
anisotropy estimator, with free weighting coefficients 
bii^i^. Bispectrum estimators are essentially correlations 
of specific power anisotropy estimators with the temper- 
ature. 

Often the dipole &im is projected out of bispectrum 
analyses, effectively ignoring Xi^, even though in gen- 
eral Xim contains interesting power asymmetry informa- 
tion that could be generated by non-Gaussianity. The 
'optimal' bispectrum estimator is only optimal with cer- 
tain assumptions; if the temperature dipole information 
is removed then the dipole part of the power anisotropy 
should contain additional information about the non- 
Gaussianity, which in principle could improve constraints 
if the non-Gaussianity is larger than the anisotropy ex- 
pected from lensing and doppler effects. This is equiva- 
lent to using part of the trispectrum, see Refs. [5TJ [52] 
for further discussion. 

A primordial statistically isotropic bispectrum can be 
defined as 

(Xo(ki)xo(k2)xo(k3)> = {2TrfS{ki+k2+k3)B^{ki,k2,h). 

(B6) 



For local non-Gaussianity 



B^{ki,k2, fcs) = ±2-fNL{Px{ki)Px{k2) + 2 perms.) 

(B7) 

where the 3/5 is conventional (from relation the curva- 
ture perturbation to the Newtonian potential in the ra- 
diation dominated era) and ± is a sign convention. The 
reduced bispectrum is then 

bhhh=^^fNL J r^drA,(r)A2(r)a/3(r) + 5 perms. 

(B8) 

We can relate this to our local spatial modulation es- 
timator of Eq. [33] by windowing to give a line-of-sight 
average of the primordial modulating field 



drr'hf^{r)Wi{r) 



X "^bihh 

hh 



■ (B9) 



This is of the form of a general quadratic anisotropy es- 
timator, but with weights in the specific form 



hh = j <^rr'^Wi{r)ai,{r)(3i^{r). 



(BIO) 



If the modulating field is the primordial field itself, 
^ = Xo, there is local bispectrum. In this case choosing 
Wi{r) = Pi{r) relates the primordial field to the observed 
temperature, since l3{r)Qim is the minimum variance es- 
timate for the primordial field at r [41]. Then hu^i^ is the 
form of the reduced bispectrum obtained by correlating 
him with the temperature Qim- 
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